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Abstract 



We compute the pointwise asymptotics of orthogonal polynomials with respect to a general class of 
pure point measures supported on finite sets as both the number of nodes of the measure and also the 
degree of the orthogonal polynomials become large. The class of orthogonal polynomials we consider 
includes as special cases the Krawtchouk and Hahn classical discrete orthogonal polynomials, but is far 
more general. In particular, we consider nodes that are not necessarily equally spaced. The asymptotic 
results are given with error bound for all points in the complex plane except for a finite union of discs 
of arbitrarily small but fixed radii. These exceptional discs are the neighborhoods of the so-called band 
edges of the associated equilibrium measure. As applications, we prove universality results for correlation 
functions of a general class of discrete orthogonal polynomial ensembles, and in particular we deduce 
asymptotic formulae with error bound for certain statistics relevant in the random tiling of a hexagon 
with rhombus-shaped tiles. 

The discrete orthogonal polynomials are characterized in terms of a a Riemann-Hilbert problem 
formulated for a meromorphic matrix with certain pole conditions. By extending the methods of [17, 
22], we suggest a general and unifying approach to handle Riemann-Hilbert problems in the situation 
when poles of the unknown matrix are accumulating on some set in the asymptotic limit of interest. 

1 Introduction 

This announcement concerns asymptotic properties of polynomials that are orthogonal with respect to pure 
point measures supported on finite sets. Let N € N be fixed, and consider N distinct real nodes a;jv,o < 
xn,i < ■ ■ ■ < xn,n-i to be given; together the nodes make up the support of the pure point measures we 
consider. We use the notation Xjy := {xN,n}n~o f° r the support set. Along with nodes we are given positive 
weights iujv,0j wjv,i, ■ ■ • , Wn,n-i, which are the magnitudes of the point masses located at the corresponding 
nodes. The discrete orthogonal polynomials associated with this data are polynomials {pN,k{z)}^SQ where 
PN,k(z) is of degree exactly k with a positive leading coefficient and where 
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If PN,k(z) = Cft k z k + • • • + c^ k , then we denote by ~KN,k{z) the associated monic polynomial pN,k{z)/c^\. 
These polynomials exist and arc uniquely determined by the orthogonality conditions because the inner 
product associated with ([!]) is positive definite on span(l, z, z 2 , . . . , z N ~ l ) but is degenerate on larger spaces 
of polynomials. The polynomials PN,k(z) may be constructed from the monomials by a Gram-Schmidt 
process. A general reference for properties of orthogonal polynomials specific to the discrete case is the 
book of Nikiforov, Suslov, and Uvarov p3f . In contrast to the discrete orthogonal polynomials, we refer to 
the orthogonal polynomials with respect to an absolutely continuous measure as the continuous orthogonal 
polynomials. 

We use the notation Zjv for the set {0, 1, 2, . . . , N — 1}. Examples of classical discrete weights are [[j] 

• Krawtchouk weight: on the nodes xnj '■= (J + 1/2) /JV for j £ Z^v in the interval (0, 1), we define the 
weight 

• Hahn weight: on the infinite set of nodes arjyj := (j + l/2)/N for j = 0, 1, 2, . . . , we define the weight 

th a\ NN ^ r(6)r( c + j)r(d + j) 

«sjQ>, c, d) := ^ ■ T{j + mb + mc)m , (3) 

where 6, c, and <i are real parameters. Special cases are the following. First, taking d = a and 
b = 2 — N — j3, with a, (3 > 0, and taking the limit c — > 1 — iV, we obtain the weight 

(j + a-l\(N + 0-2- f 

N™- 1 \ i l\ N 
w^(a,(3) := • A J TW^J^ 

(3-1 

which is defined on the finite set of nodes Xn := (Zjv + l/2)/N. Second, taking c? = 2 — TV — /3 and 
& = a, with a, /3 > 0, and taking the limit c — ► 1 — N, we obtain the weight 



^ (a, /?) - r(j - 1)r(Q - j)r(Ar _ j)r{N + (i _ 1 _ ] y for ^ Z w , (5) 



which is again defined on the nodes Xn '■= (Zjv + 1/2) /N. 

< Remark: In Q| , the polynomials orthogonal with respect to the weight (^) are called the Hahn polynomials, 
and in Q, the same weight is called the Askey-Lesky weight. For the two special cases where the weight 
is supported on a finite set of nodes, we now adopt the terminology used by Johansson Jl6| |, and thus refer 
to the weight (|]) simply as the Hahn weight (corresponding to the Hahn polynomials) and we refer to the 
weight (|^) as the associated Hahn weight (corresponding to the associated Hahn polynomials). > 

Our goal is to establish the asymptotic behavior of the polynomials pM,k{z) or their monic counterparts 
TtN,k{z) in the limit of large degree, assuming certain asymptotic properties of the nodes and the weights. In 
particular, the number of nodes must necessarily increase to admit polynomials with arbitrarily large degree, 
and the weights we consider involve an exponential factor with exponent proportional to the number of nodes 
(such weights arc sometimes called varying weights) . We will obtain pointwisc asymptotics with precise error 
bound uniformly valid in the whole complex plane with the exception of certain arbitrarily small open discs. 
Our assumptions on the nodes and weights include as special cases all relevant classical discrete orthogonal 
polynomials, but are significantly more general; in particular, we will consider nodes that are not necessarily 
equally spaced. 
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1.1 Motivation and applications. 

In the context of approximation theory, there has been recent activity J^, ^| in the study of polynomials 
orthogonal on the real axis with respect to general continuous varying weights and the corresponding large 
degree pointwise asymptotics. One of important application of the results of || [| is the proof of several 
universality conjectures of random matrix theory. Thus, a natural question to ask is whether it is possible to 
extend the results of || ||] to handle discrete weights, and obtain similar universality results for the so-called 



discrete orthogonal polynomial ensembles (see § 3.1 ) 



Indeed, it has turned out recently that various problems of percolation models, random tiling, qucucing 
theory, non-intersecting paths and representation theory can be reformulated as asymptotic questions of 
discrete orthogonal polynomial ensembles with very concrete weights (see for example, |l4|, [H| || ). In these 
ensembles, the weights are all classical (Mcixner, Charlier, Krawtchouk or Hahn). Using integral formulae 
for the corresponding orthogonal polynomials, the relevant asymptotics have been analyzed except for the 
Hahn weight case. The weights handled in this paper include the Hahn weight (and also Krawtchouk), and 
hence as a corollary, we obtain new asymptotic results for the Hahn polynomials. As the discrete orthogonal 
polynomial ensemble for the Hahn weight arises in the statistical analysis of random rhombus tilings of a 



hexagon, our asymptotic results for Hahn polynomials yield new results on this problem (see § 3.2) 



1.2 Methodology. 

The method we use is the Riemann-Hilbcrt characterization of discrete orthogonal polynomials, and an 
adaption of the Dcift-Zhou method for the steepest-descent analysis of Riemann-Hilbcrt problems. The 
Riemann-Hilbert problem for discrete orthogonal polynomials has poles instead of usual jump conditions on 
a continuous contour, and the poles are accumulating in the limit of interest to form a continuum. 

There has been some recent progress jl7|, in the integrable systems literature concerning the problem of 
computing asymptotics for solutions of integrable nonlinear partial differential equations (e.g. the nonlinear 
Schrodinger equation) in the limit where the spectral data associated with the solution via the inverse- 
scattering transform is made up of a large number of discrete eigenvalues. Significantly, inverse-scattering 
theory also exploits much of the theory of matrix Riemann-Hilbert problems, and it turns out that the 
discrete eigenvalues appear as poles in the corresponding matrix-valued unknown. So, the methods recently 
developed in the context of inverse-scattering actually suggest a general scheme by means of which an 
accumulation of poles in the matrix unknown can be analyzed. 

In this paper, we extend the method of (l?], ^2) and suggest a general and unifying approach to handle 
Riemann-Hilbert problems for the situation when poles are accumulating. Especially we overcome the 
following two issues: 

(a) How to transform a Riemann-Hilbcrt problem with pole conditions to a Riemann-Hilbcrt problem with 
an analytic jump condition on a continuous contour so that a formal continuum limit of poles can be 
rigorously justified and the Dcift-Zhou method can be applied. 

(b) How to handle the upper constraint of the so-called equilibrium measure, and thus to correctly formulate 
an appropriate g-function. 

See § ^ for more information about these ideas. Full details will be given in the paper corresponding to this 
announcement . 

1.3 Basic assumptions. 

We state here precise assumptions on the nodes Xpj and weights {wN.j}- 
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1.3.1 Conditions on the nodes. 

1. The nodes lie in a bounded open interval (a, 6) and are distributed with a density p°(x). 

2. The density function p (x) is real analytic in a complex neighborhood of the closed interval [a, b], and 
satisfies: 

b 

o 



p u {x)dx = l, (6) 

J a 

and 

p°(x) > strictly, for all x G [a, b]. (7) 
3. The nodes are defined precisely in terms of the density function p°(x) by the quantization rule 

p °( x)dx= 2 -i±± (8) 



2iV 



for N G N and j G Zjv- 



1.3.2 Conditions on the weights. 

1. Without loss of generality we write the weights in the form 

N-l N-l 



n=0 n=0 

where the family of functions {Vn(x)} is apriori specified only at the nodes. 

2. We assume that for each sufficiently large N, Vjv(x) may be taken to be a real analytic function defined 
in a neighborhood G of the closed interval [a, b], and that 

VW(x) = ^(a;) + — + -p- (10) 
where V(x) is a fixed real analytic function defined in G, 7 is a constant, and 

limsupsup |t7at(z)| < 00 . (11) 

JV->oo 26G 



< Remark: In some applications it is desirable to generalize further by allowing 7 in (|1C|) to be a real 
analytic function in G with Y(z) not identically zero. It is possible to take into account such variation, but 
for simplicity we take 7 to be constant in this paper. For classical cases of Hahn and Krawtchouk weights, 
7 is indeed constant in scalings of interest. > 

The familiar examples of classical discrete orthogonal polynomials correspond to nodes that are equally 
spaced, say on (a, b) = (0, 1) (in which case we have p°(x) = 1). In this special case, the product factor on 
the right-hand side of (||) becomes simply 

W-l _ N N-! 

II \<*Nj - xnM' 1 - jKN _ j _ 1)[ ■ (!2) 

By Stirling's formula, taking the continuum limit of this factor (that is, considering N — > 00 with j/N — > x) 
shows that in these cases the formula @ leads to a continuous weight on (0, 1) of the form 

/ e -V(x) \N 

^ = U(i-^ j (i3) 
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up to an overall multiplicative constant. 

Our choice of the form (^|) for the weights is motivated by several specific examples of classical discrete 
orthogonal polynomials. The form (^) is sufficiently general for us to carry out useful calculations related to 
proofs of universality conjectures arising in statistical problems like the random rhombus tiling of a hexagon. 



1.3.3 Conditions on the equilibrium measure. 

There is an additional assumption, which is not as explicit as the previous two conditions. This assumption 
will be explained in the next subsection. 



1.4 The equilibrium energy problem and third assumption on the weights. 

It has been recognized for some time (sec |l9| and references therein) that, as in the continuous orthogonal 
polynomial cases, the asymptotic behavior of discrete orthogonal polynomials, in particular the distribution 
of zeros in (a, 6), is related to a constrained equilibrium problem for logarithmic potentials in a field <p{x) 
given by the formula 

tp{x) := V(x) + / log \x-y\p°{y)dy (14) 



for x € (a, b). We can also view tp(x) as being defined via a continuum limit: 

fix) = ~ Inn rr^- (15) 

iv— >oo iv 

where WN.j is expressed in terms of xm,j which in turn is identified with x. Thus at the moment we are 
working with the formal continuum limit of the weight Wnj. 

In the specific context of this paper, the field <p(x) is a real analytic function in the open interval (a, b) 
because V(x) and p°(x) are real analytic functions in a neighborhood of [a,b]. Unlike V(x) and p°(x), 
however, the field <p(x) does not extend analytically beyond the endpoints of (a, b) due to the condition (0). 

Given the parameter c G (0, 1), which has the interpretation of the ratio of the degree k of the polynomial 
of interest to the number N of nodes, and the field ip(x) as above, consider the quadratic functional 

b r b i i-b 



a J a 



E c \pt]:=c\ I log dn(x)dn(y) + / tp(x)dfi{x) (16) 



of Borel measures p on [a, b]. Let /i„ lin be the measure that minimizes E c {p] over the class of measures 
satisfying the upper and lower constraints 

0< I dfj,(x)<-[ p°{x)dx (17) 
JxeB c JxeB 

for all Borel sets B C [a, b], and the normalization condition 

rb 

dp(x) = 1 . (18) 



The existence of a unique minimizcr under the conditions enumerated in § 1.3.1 and § 1.3.2 follows from 
the Gauss-Frostman Theorem; see |24| for details. We will often refer to the minimizer as the equilibrium 
measure. It has been shown |]l9f that the equilibrium measure is the weak limit of the normalized counting 
measure of the zeros of pM,k{ z ) m the limit N — » oo with c = k/N fixed. 

That a variational problem plays a central role in asymptotic behavior is a familiar theme in the theory 
of orthogonal polynomials. The key new feature contributed by discreteness is the appearance of the upper 
constraint on the equilibrium measure (i.e. the upper bound in (|T7|)). The upper constraint can be traced 
to the following well-known fact. 



■5 



Proposition 1. Each discrete orthogonal polynomial PN,k(z) has k simple real zeros. All zeros lie in the 
range xn.o < z < xn,n-i and no more than one zero lies in the closed interval [xjv, n , aJjv,n+i] between any 
two consecutive nodes. 

Thus, the presence of the upper constraint proportional to the local density of nodes is necessary for the 
interpretation of the equilibrium measure as the weak limit of the normalized counting measure of zeros. 

The theory of the "doubly constrained" variational problem we are considering is well-established. In 
particular, the analytic properties we assume of V(x) and p°(x) turn out to be unnecessary for the mere 
existence of the minimizer. However, it has been shown [[l8] that analyticity of V(x) and p°(x) in a neigh- 
borhood of [a, b] guarantees that li% sin is continuously differentiable with respect to x e (a, b). Moreover, 
the derivative dp-^^/dx is piecewise analytic, with a finite number of points of nonanalyticity that may not 
occur at any x where both (strict) inequalities d^ in / dx(x) > and dp^ nin / dx(x) < p°(x)/c hold. We want 
to exploit these facts, which is why we have chosen to restrict attention to analytic functions V(x) and p°(x). 

For a method of computing the equilibrium measure from the coefficients in the three-term recurrence 
relation for a special class of discrete weights, see and reference therein. See also J5J for continuous 
weights. 

For simplicity of exposition we want to exclude certain nongeneric phenomena that may occur even under 
conditions of analyticity of V{x) and p°(x). Therefore we introduce the following assumptions. 

1.4.1 Third assumption; conditions on the equilibrium measure. 

Let T_ C [a, b] denote the closed set of x-values where dfi'j Ilin /dx(x) = 0. Let T C [a, b] denote the closed set 
of x-values where d/i'^ nin /dx(x) = p°(x)/c. 

1. Each connected component of T_ and T has a nonempty interior. Therefore T_ and T are both finite 
unions of closed intervals, where each closed interval that is part of the union contains more than one 
point. Note: this does not exclude the possibility that cither T_ or T might be empty. 

2. For each open subinterval / of (a, b) \ T_ U J- and each limit point zq € T_ of /, we have 

1 du c ■ 

lim -^(z) = K withO< J ftT<oo (19) 

x^z ,x£i ^j\ x — Zq | dx 

and for each limit point zq G T of /, we have 

1 



lim 



c dx 



K with < K < oo . (20) 



Therefore the derivative of the minimizing measure meets each constraint exactly like a square root. 

3. A constraint is active at each endpoint: {a, b} C T_ U T . 

It is difficult to translate these conditions on sufficient conditions on c, V{x), and p°(x). However, 

there is a sense in which they are satisfied gencrically. 

<i Remark: Relaxing the condition that a constraint should be active at each endpoint requires specific 
local analysis near these two points. We expect that a constraint being active at each endpoint is a generic 
phenomenon in the sense that the opposite situation occurs only for isolated values of c. We know this 
statement to be true in all relevant classical cases. For the Krawtchouk polynomials only the values c = p 
or c = q = 1 — p correspond to an equilibrium measure that is not constrained at both endpoints (see [fll[ ) . 
The situation is similar for the Hahn polynomials. i> 



G 



1.4.2 Voids, bands and saturated regions. 



Under the conditions enumerated in § 1.3.1, § 1.3.2, and § 1.4.1, the minimizcr n^un partitions (a, b) into three 
kinds of subintcrvals, a finite number of each, and each having a nonempty interior. There is a real constant 
E c , the Lagrange multiplier associated with the condition ([l8]); so that with the variational derivative defined 

as 

—^(x) := -2c / log \x - y\ dfi(y) + <p{x) , (21) 

SfJ- Ja 

we have when fi = /x^^ the following types of subintcrvals. 

Definition 1 (Voids). A void T is an open subinterval of [a,b] of maximal length in which /^^(x) = 0, 
and thus the minimizer realizes the lower constraint. For x £ 1 we have the strict inequality 

<)E 

-f^{x)>l c . (22) 

Definition 2 (Bands). A band I is an open subinterval of [a,b] of maximal length where fj,^ in (x) is a 
measure with a real analytic density satisfying < dpi^^/dx < p°(x)/c, and thus variations of the minimizer 
are free. For x € I we have the equilibrium condition 

^(x)=£ c . (23) 
op, 

Definition 3 (Saturated regions). A saturated region T is an open subinterval of [a, b] of maximal length 
in which dpf^^jdx = p°(x)/c, and thus the minimizer realizes the upper constraint. For x G T we have the 
strict inequality 

?p-(x)<l c . (24) 

on 

See Figure [l] for an example of the voids, bands and saturated regions associated with a hypothetical 
equilibrium measure. 

Voids and saturated regions will also be called gaps when it is not necessary to distinguish between these 
two types of intervals. The closure of the union of all subintcrvals of the three types defined above is the 
interval [a, b]. From condition 1 in § [l.4.l| above, bands cannot be adjacent to each other; a band that is not 
adjacent to an endpoint of [a, b] has on each side either a void or a saturated region. 

Some of our asymptotic results for the discrete orthogonal polynomials under the above assumptions are 
stated in § ||. In § |], the asymptotics of discrete orthogonal polynomials are applied to discrete orthogonal 
polynomial ensembles, and asymptotics of corresponding correlation functions are thus obtained. By special- 
izing to the Hahn ensemble, we arrive at specific results relevant in the problem of random rhombus tiling 
of a hexagon. Our new methods for the asymptotic analysis of general discrete orthogonal polynomials are 
discussed in § 0. 



2 Results: Pointwise Asymptotics of Orthogonal Polynomials 

As stated in the Introduction, we obtain pointwise asymptotics of the orthogonal polynomials pN,k(z) for z 
in the complex plane except for a finite union of discs of arbitrarily small but fixed radii in terms of a log 
transform of the equilibrium measure. These discs are centered at the edges of the bands. The difficulty at 
the edge of the bands will be explained below (see § |]). We hope to be able to handle the band edge problem 
in our future publication. 

We actually present here formulae for the monic polynomials ^^^(z), rather than the normalized poly- 
nomials pN,k{z). The asymptotics are given by different formulae in five different regions of the complex 
plane excluding small discs around the edges of the bands. These regions are (recall that [a, b] is the interval 
where the nodes are accumulating) 
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Figure 1: The hypothetical minimizer illustrated here partitions the interval (a,b) into voids (denoted V), 
bands (denoted B), and saturated regions (denoted S) as shown. 

(a) Outside the interval [a, b]. 

(b) Voids in [a, b]. 

(c) Bands in [a, b]. 

(d) Saturated regions in [0,6]. 

(e) Near the endpoints of [a, b] adjacent to a saturated region. 

The results for the first three cases are analogous to the corresponding results for the continuous orthogonal 
polynomials analyzed in ||. Also the asymptotic formula for 7Tjv,fc near the endpoints of [a, b] adjacent to 
a void is analogous to the corresponding asymptotics of continuous orthogonal polynomials whose weight is 
supported on a finite interval. The new cases that did not occur in the continuous orthogonal polynomial 
theory are the regions (d) and (e). In these regions, the discrete nature of the support of the weights is 
strongly present. We here present only the three regions (c), (d), (e). Asymptotic formulae for z in regions 
(a) and (b) will appear in the full version of this paper, together with the asymptotics of the leading coefficient 
c tv fe which completes the connection with the polynomials pN,k(z). 

For simplicity, we take the limit fc, N — + 00 while c = k/N is a fixed rational number. (Recall k is the 
degree of the orthogonal polynomial, and N is the number of nodes.) This means that if the fixed rational 
constant c is represented in lowest terms as c = p/q, then we arc taking k = Mp and TV = Mq for M G N. 
The more general case of k/N = c + 0(N^ 1 ) will be considered in a future publication. 

In the Theorems |l| and ^ below, the error bounds are different depending on the following two situations 
of the support of the equilibrium measure. 

• Case I: there is at least one void interval and at least one saturated region. 
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• Case II: there are only voids and bands, or only saturated regions and bands. 

We have obtained a better error bound for Case II than for Case I. Whether this is only a technical point, 
or whether this is in the very nature of discrete orthogonal polynomials is not clear yet. 

2.1 Bands. 



1.3. j § |o4 



Theorem 1 (Asymptotics of itN,k(z) m bands). Assume the conditions enumerated in 
and § \l-4-j . Let c = k/N be fixed. Then, uniformly for z in any fixed compact subinterval in the interior of 
a band I , 



KN,k(z) = exp f k J \og\z - x\ dtf nin (x) j ■ 2Ai(z) cos ( kn J dfi^ in (x) + $i(z) j +e N (z) 



(25) 



where 



e N {z) = < 



O 



O 



log(AQ 

N l/3 

log(N) 

TV 2 / 3 



Case I , 
Case II , 



(26) 



and Ai(z) > and <&i(z) are real functions defined in terms of a Riemann theta function associated to the 
hyperelliptic surface with cuts given by the bands. The functions Aj and $/ are uniformly bounded along 
with all derivatives. 

< Remark: The error estimates quoted above are derived from the asymptotic procedure we have devised 
for the case of node distributions associated with a general analytic density function p (x). In the special 
case of equally spaced nodes (when p°(x) is a constant function), we have recently found using a different 
procedure that the logarithmic term log(iV) in the error bound £n(z) can apparently be removed. At this 
time we do not know whether this alternate procedure can be modified for nonconstant p° (x) so as to remove 
the logarithm from the estimates in all cases. > 

2.2 Saturated regions. 

Theorem 2 (Asymptotics of HN.k{z) in saturated regions). Assume the conditions enumerated in 
and § Let c = k/N be fixed. Then, uniformly for z in a saturated region T, but 



§ 1-3.1, § 1.3. 



bounded away from any band edge points where the equilibrium measure becomes unconstrained by a fixed 
distance and from the endpoints a and b by a distance of size N~ 2 / 3 , we have 



KN,k{z) = exp [k log \z - x\ dp c min {x) 



(z) + £jy(z)) ■ 2 cos nN / p (x) dx + exponentially small 



(27) 



with £n{z) having the same error bound as in (|26|), where (f>r{z) is a real function defined in terms of a 
Riemann theta function, uniformly bounded along with all derivatives and having at most one zero in T. If the 
saturated region T is adjacent to an endpoint of [a, b], then 4>r{z) has no zeros in T. The exponentially small 
term is proportional to exp(iV [SE c /dp — £ c ]) where the variational derivative is evaluated on the equilibrium 



Since the zeros of the cosine function in ( p7| ) are exactly the nodes of orthogonalization making up the 
set Xn, and since the slope of the cosine is proportional to N at the nodes, we have the following. 
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Corollary 1 (Exponential confinement of zeros). Let J be a closed subinterval of a saturated region 
where the equilibrium measure achieves the upper constraint. Then the monic discrete orthogonal polynomial 
KN,k{z) has a zero uniformly close to each the nodes XN, n E XjyDJ, with the possible exception of one node. 

< Remark: The factor <pr{z) + £n{z) has at most one zero in V, and the zero can be present for some N 
and not others. Also, the zero generally moves about in a quasipcriodic manner as N is varied. So it seems 
that one should regard the situation in which this zero is exponentially close to one of the nodes (which form 
a set of measure zero in J) as being anomalous and quite rare. Therefore one should generally expect to see 
a zero exponentially close to each node in Xn H J. > 

Let if be a subinterval of the interval J that is the subject of Corollary |l| such that there is a zero of 
KN,k(z) exponentially close to each node in XnC\K (so according to the previous remark one expects that it is 
typically consistent to take K = J). The exponential confinement of the zeros in K has further consequences 
due to the rigidity of the zeros for general discrete orthogonal polynomials described in Proposition ^. A 
particular zero zq e K of 7rjv,fc(z), asymptotically exponentially localized near a node XN,n, can lie on one 
side or the other of the node. But if zq lies to the right of xn,„, then it follows from Proposition [| that the 
smallest zero greater than Zq must also lie to the right of xn,u+i and so on, all the way to the right endpoint 
of K . Likewise, if zq lies to the left of xn n , then all zeros in K less than zq also lie to the left of the nodes 
to which they are exponentially attracted. 

When we consider those zeros of iTN,k{z) that converge exponentially fast to the nodes (these are analogous 
to the Hurwitz zeros of the approximation theory literature, whereas the possible lone zero of </>r( 2 ) would 
be called a spurious zero), we therefore see that there can be at most one "dislocation" (i.e. a closed interval 
of the form [xn,u, aJjv,n+i] containing no Hurwitz zeros) in the pattern of zeros lying to one side or the other 
of the nodes. See Figure |[ 

|o---|o---|o---|o---|o---|o---|o---|o---|o---|o---|o---|o---|o---|o---|o--- 



-|o — |o — |o — |o — |o — |o- 



Figure 2: The Hurwitz zeros are exponentially close to the nodes of orthogonalization in saturated regions 
where the equilibrium measure achieves its upper constraint. Top: a pattern without any dislocation, where 
all Hurwitz zeros (pictured as circles) lie to the right of the nodes (vertical line segments) to which they are 
exponentially attracted. Bottom: a pattern with a dislocation. Middle: there may only be one dislocation, 
but it can move as parameters (e.g. c) are continuously varied and a Hurwitz zero passes through one of the 
nodes. 

< Remark: It should perhaps be mentioned that there is nothing that prevents a zero of ~KN,k{z) from 
coinciding exactly with one of the nodes x^j E Xjy. > 

Furthermore, due to Proposition ^, a spurious zero of 7Tjv,k(.z) in the subinterval K of the saturated region 
r can only occur if the pattern of Hurwitz zeros in K has a dislocation as in the bottom picture in Figure |^, 
in which case the spurious zero must lie in the closed interval [xjq.m XN.n+i] associated with the dislocation. 
Equivalently, the presence of a spurious zero in K C J C T indicates a dislocation in the pattern of Hurwitz 
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/From the analysis we have presented it is not clear whether the presence of a dislocation in the pattern 
of Hurwitz zeros implies that the function 4t{z) has a (spurious) zero in the corresponding closed interval 
[xjv.n; XN.n+i], or equivalcntly whether the absence of any zeros of 4>r(z) in a saturated region T means that 
the lone possible dislocation in the pattern of Hurwitz zeros is indeed absent. 



2.3 Near hard edges adjacent to a saturated region. 



Theorem 3 (Asymptotics of ~KN,k{z) near hard edges). At 



the conditions enumerated 



§ \l.3. 6 \ and § \l4-j Let c = k/N be fixed. If the upper constraint is achieved at the endpoint z 
uniformly for b — CN~ 2 / 3 < z <b, 



1.3.1 



b, then 



TTN,k( Z ) 



exp 




2 cos 




(28) 



exponentially small 



TV — > oo, and uniformly for b < z < b + CN 2 / 3 , 



itnA z ) = exp fc / log \z - x\ d/i^ in (a;) 



r(i/2 + a) 



N l/3 



(29) 



as N — > oo, where : = Np°(b)(z — b) and the function <fir(z) is same function that appears in (|27|). 
The function (j)° tLtsldc (z) is real-valued, and nonvanishing, and like 4>r(z) is constructed from Riemann theta 
functions and is along with all derivatives uniformly bounded in z as N — > oo. There are similar formulae 
near the endpoint z = a when the upper constraint is active there. The quantity r(l/2— £&) is the Euler gamma 
function, while the subscript V refers to the saturated region adjacent to the hard edge. The exponentially 
small term is proportional to exp(N[6E c /6fi — £ c ]) evaluated on the equilibrium measure. 

It is particularly interesting that the exponential attraction of the zeros to the nodes of orthogonalization 
in Xn, that we have seen is a feature of the asymptotics in subintcrvals of [a, b] where the upper constraint is 
achieved by the equilibrium measure, persists right up to the first and last nodes; in other words if the upper 
constraint is achieved at z = a then there is a zero exponentially close to x n,o an d if the upper constraint is 
achieved at z = b then there is a zero exponentially close to xn,n-i- 

More is true, however. From Proposition |l], we know that a zero z$ of 7rjv,fc(z) that is exponentially 
close to the first node xn,o must in fact satisfy the strict inequality zq > xn,o- Similarly, if a zero zo is 
exponentially close to the last node xn,n-i, then it must satisfy the strict inequality zo < xn,n-i- Going 
back to the discussion in § 2.2, we see that if there is a hard edge at an endpoint of [a, b], then in the 



saturated region adjacent to the hard edge there can be no dislocations in the pattern of Hurwitz zeros. This 
is consistent with the fact (see Theorem ^) that the function 4>r( z ) does not vanish in any saturated region 
adjacent to endpoints of [a, b] so that there is no spurious zero. 

<i Remark: The fact that the asymptotic formulae presented in Theorem || are in terms of the Euler 
gamma function is directly related to the discrete nature of the weights. In a sense, the poles of the functions 
r(l/2±£ a ) and r(l/2±£b) are "shadows" of the poles of the Riemann-Hilbert problem that we will discuss 
below in § H > 
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3 Applications 



3.1 Discrete orthogonal polynomial ensembles. 

Recall that Xj^ = {xN. n }n=o * s t ne se ^ °f n °des in (a, 6). In this section, we use the notation wn(x) for a 
weight on Xjy; to connect with our previous notation note simply that for a node x = xnj S -XjVj 

wn(x) = Wjvj ■ (30) 

Consider the joint probability distribution of finding k particles, say Pi, . . . , P&, at respective positions 
xi, . . . , Xk in Xn, to be given by the following expression: 

P(particle Pj lies at the site Xj, for j = 1, . . . , fc) = p^' fe '(a;i, . . . , Xfc) 

(31) 



r— (zi-Zj) 2 JJwjv(z.j)> 



l<t<J<k j = l 

(we are using the symbol P(event) to denote the probability of an event) where is a normalization 

constant (or partition function) chosen so that 

J2 pW( Xl ,...,x k ) = l. (32) 

admissablc configurations of Pi, . . . , P k 

Since the distribution function is symmetric in all Xj, we can consider the particles Pj to be either distin- 
guishable or indistinguishable, and only the normalization constant will depend on this choice (the meaning 
of "admissablc configurations" in (|32|) is different in the two cases). The statistical ensemble associated with 



the density function (31) is called a discrete orthogonal polynomial ensemble. 

Discrete orthogonal polynomial ensembles arise in a number of specific contexts (see for example, [fl4| , [Te| , 
p7i| . [|), with particular choices of the weight function iujv(-) related (in cases we are aware of) to classical 
discrete orthogonal polynomials. It is of some theoretical interest to determine properties of the ensembles 
that are more or less independent of the particular choice of weight function, at least within some class. 
Such properties are said to support the conjecture of universality within the class of weight functions under 
consideration. 

Some common properties of discrete orthogonal polynomial ensembles can be read off immediately from 
the formula For example, the presence of the Vandermonde factor means that the probability of 

finding two particles at the same site in Xn is zero. Thus a discrete orthogonal polynomial ensemble always 
describes an exclusion process. This phenomenon is the discrete analogue of the familiar level repulsion 
phenomenon in random matrix theory. Also, since the weights are associated with nodes, the interpretation 
is that configurations where particles are concentrated in sets of nodes where the weight is larger are more 
likely. 

The goal of this section is to establish asymptotic formulae for various statistics associated with the 



ensemble (31) for a general class of weights in the continuum limit N — > oo with the number of particles k 
chosen so that for some fixed rational c e (0, 1), we have k — cN. Note that the number of particles k will 
have the same role as the degree of orthogonal polynomials k. We use the same assumptions on the nodes and 



weights as in the rest of the paper (see § 1.3.1, § 1.3.2, and § |l.4.l|). The main idea is that, as is well-known 



the formulae for all relevant statistics of ensembles of the form ( |3l|) can be written explicitly in terms of the 
discrete orthogonal polynomials associated with the nodes Xjy and the weights Wnj = w^(xnj). 

To relate the statistics of interest to the discrete orthogonal polynomials, we first define the so-called 
reproducing kernel (Christoffel-Darboux kernel) 



fe-i 



K N , k (x,y) := \/w N (x)w N (y)y^^pN : n(x)p Ntn (y) , (33) 



n=0 
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for x, y in the nodes. Using the Christoffel-Darboux formula pq| . which holds for all orthogonal polynomials, 
even in the discrete case, the sum on the right telescopes: 

Sk-i) 



v , x / T\ TT C W,fc-l PN,k( x )PN,k-l{y) -PN,k-l(x)p N:k (y) 

K Nik {x,y) = ^w N (x)w N {y) — m '■ 

(34) 



sJwN(x)WN{y)- 



x-y 



Standard calculations (see, for example, |2l| or [g7|) of random matrix theory, in the case of so-called (3 = 2 
ensembles, yield the following exact formulae. The m-point correlation function defined for m < k by 

Rg*)( Xl ,... iXm ) :=—*!— V p^(x u ...,x k ), (35) 

(k — my. 

(x m + 1 ,...,X k )€X N 

where denotes the pth Cartesian power of Xn, can be expressed in terms of the discrete orthogonal 
polynomials by the formula 

R^' k \ Xl , ...,x m )= de.fAv.Air,.,-,,).. ir w . (36) 
For any set B C Xpj, the 1-point correlation function has the following interpretation: 

R^' 1 *' (x) = E(number of particles in B) , (37) 

x€B 



where E denotes the expected value. Similarly, the 2-point correlation function has the following interpreta- 
tion: 

R,2 N ' k \x, y) = E(number of (ordered) pairs of particles in B). (38) 

x,y£B 

Furthermore, the statistic defined for a set B c Xm and m < min(^B, k): 

A^' k ^ (B) := P(there are precisely m particles in the set B) (39) 

(this probability is automatically zero if m > j^B by exclusion) is well-known to be expressible by the exact 
formula 

1 / d n 



dt r 



det(l-tK N , k L), (40) 



t=i 

where ifjv,fc is the operator (in this case a finite matrix, since B is contained in the finite set Xn) acting in 
£ 2 (Xn) given by the kernel Kw t k(%, y), and KN,k\ B denotes the restriction of Km,u to £ 2 (B). 

This is by no means an exhaustive list of statistics that can be directly expressed in terms of the orthogonal 
polynomials associated with the (discrete) weight wn(')- For example, one may consider the fluctuations 
and in particular the variance of the number of particles in an interval B C X^. The continuum limit 
asymptotics for this statistic were computed in jl6| for the Krawtchouk ensemble (see Proposition 2.5 of 
that paper) with the result that the fluctuations are Gaussian; it would be of some interest to determine 
whether this is special property of the Krawtchouk ensemble, or a universal property of a large class of 
ensembles. Also, there are convenient formulae for statistics associated with the spacings between particles; 
the reader can find such formulae in section 5.6 of the book Q. 

Depending on the location of interest, we have different results. We distinguish again three regions: 
bands, voids and saturated regions. 
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3.1.1 In a band. 

Theorem 4 (Universality of the discrete sine kernel). For a node x £ lying in a band I, 

"TO^K 1 * ^))' (41) 

where the error is uniform in compact subsets. For distinct nodes x and y in I , 

K NAx ,y) = w °^ y (42) 

where 0(1) is uniform for x and y in a compact subsets. Also with a given node and for £ and rj such 

that 

X + AT Of VP 1 V G XN md X + AT Of \ V 1 V G XN ' ( 43 ) 

Np (x)K N ^(x,x) Np {x)K N ^ k (x,x) 

we have 

1 K ( . j g \ _ sin(7r - (g - n)) ( \og(N) \ 

K N , k (x,x) N ' k \ X+ Np°(x)K N!k (x, X y X+ Np°(x)K N , k (x,x)J n-^-rf) \ N ) l( > 

where the error is uniform for x in a compact subset of the band I, and £ and n in a compact set o/R. 
< Remark: Let V'min = ^Mmm/ ^ x - By the same analysis, we have the same limit for 

Ax) :^Jx + ls -^ t x +Tri ^) (45) 



with the same error bound. > 

< Remark: We believe that the logarithmic term log(iV) in the error can be replaced by 1 whenever the 
nodes are equally spaced (c/. the remark immediately following Theorem^]), and it may be the case that 
such an improved estimate holds more generally. In any case, the improved factor of 1/N compared to either 
1/N 1 / 3 or 1/N 2 / 3 as one might expect from the form of the error term En(z) in Theorems |l| and [| is due to 
the particular structure of the kernel Kjsi, k . Operators with this special type of kernel are called integrable 
operators (see jl3| and &). > 

Let the operator S x act on £ 2 (Z) with the kernel 

S x (i,j) = V 9 7 '-, (46) 

7T- (1-3) 

Recall the formula (fh)]) for Am ,k ^ (B) and its interpretation (^) as a probability. 

Theorem 5 (Asymptotics of local occupation probabilities). Let C Xjs; be a set of M nodes of 
the form 

B N = {x Nt3 ,XN, 3 +k 1 ,XN,j+k 2 , ■ ■ ■ , x N,]+k M -i} (47) 
where #i?v = M is independent of N , and where 

< k\ < k 2 < ■ ■ ■ < fcjw-i all in Z (48) 

are also all independent of N . Set Mjy := {0, ki, k%, . . . , kjn—i} C Z. Suppose also that as N — > oo, 
xnj = minBjv — > x with x lying in a band (and hence the same holds for a;jv,j+fe M -i = maxS^,). Then, as 
N — ► oo, 

det (l - tK N , k \ BN ) = det (l - tS x \ MN ) + O , (49) 



for t in a compact set in C, and 



Am ,k) {B N ) = — 
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3.1.2 In voids and saturated regions. 

Theorem 6 (Exponential asymptotics of the one-point function in voids). Let T be a void interval. 
For each compact subset F of T, there is a constant Kp > such that 

K N , k (x,x) = 0{e- KpN ) as N -► oo (51) 

holds for all nodes x G X^ n F . Also, for distinct nodes x,y in n F we have 

Qt -K F N\ 

K Nik {x,y) = ^ 1. (52) 

x-y 

Thus, the one-point function is exponentially small in void intervals as TV — > oo, going to zero with a 
decay rate that is determined by the size of SE c /5fi — £ c at the node x. 

Theorem 7 (Exponential asymptotics of the one-point function in saturated regions). Let T be 

a saturated region. For each compact subset F of T, there is a constant Kp > such that 

K N , k {x, x) = 1 + 0(e- KpN ) as N -> oo (53) 
holds for all nodes x G Xn n F. Also, for distinct nodes x, y in Xn n F we have 

K N , k (x,y) = ^ 1. (54) 

x-y 

Therefore the one-point function is exponentially close to one in saturated regions. 
3.2 Random rhombus tiling of a hexagon. 

Let a, 6, c be positive integers, and consider a hexagon with sides of lengths that proceed in counter-clockwise 
order, 6, a, c, 6, a, c. All interior angles of this hexagon are equal and measure 2tt/3 radians. We call this an 
abc-hexagon. See Figure |^ for an example of an a6c-hexagon. We denote by C the lattice points indicated 
in Figure [| By definition, C includes the points on the sides (Pq,P\), {P\,P2), (P2>Pz), and {Pz,Pa), but 
excludes the points on the sides (P^Ps) and (P5,Pg). 

Consider tiling the a6c-hexagon with rhombi having sides of unit length. Such rhombi come in three 
different types (orientations) that we refer to as type I, type II, and type III; see Figure ^. Rhombi of types 
I and II are sometimes collectively called horizontal rhombi, while rhombi of type III are sometimes called 
vertical rhombi. The "position" of each rhombus tile in the hexagon is a specific lattice point in C defined 
as indicated in Figure [I| See Figure || for an example of a rhombus tiling. 

MacMahon's formula pH] gives the total number of all possible rhombus tilings of the a6c-hexagon as the 
expression 

a b c.j 1 

: :: <»> 

Consider the set of all rhombus tilings equipped with uniform probability. It is of some current interest to 
determine the behavior of various corresponding statistics of this ensemble in the limit as a, 6, c — > oo. 
In the scaling limit of n — > oo where 

a = an , b — fin , c = ^n , (56) 

with fixed a,fi,"f > 0, the regions near the six corners are "frozen" or "polar zones", while the inside of the 
hexagon is "temperate". Cohn, Larsen and Propp [H showed that in such a limit, the expected shape of the 
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Figure 3: The abc-hexagon with vertices Pi, ■ ■ ■ , Pq, and the lattice C 

Type I Type II Type III 

Figure 4: The three types of rhombi; the position of each rhombus is indicated with a dot. 

boundary of the frozen regions is given by the inscribed ellipse. Moreover, the same authors also computed 
the expected number of vertical rhombi in an arbitrary set U £ M 2 . However, this calculation was provided 
without specific error bounds. In |lq| , Johansson proved a large deviation result for the boundary shape, and 
also proved weak convergence of the marginal probability of finding, say, a vertical tile near a given location 
in a temperate region. The same paper also contains an investigation of the related Aztec diamond tiling 
model and a proof that the fluctuation of the boundary in this model is governed (in a proper scaling limit) 
by the so-called Tracy- Widom law for large random matrices from the Gaussian unitary ensemble p6| . The 
same is expected to be true for rhombus tilings of hexagons, but this is still open. 

In |]l6|| , Johansson expresses the induced probability for a given configuration of vertical or horizontal 
rhombi on a given sublattice in terms of discrete orthogonal polynomial ensembles with Hahn or associated 
Hahn weights. Even though the Hahn weight is a classical weight, the relevant asymptotics for Hahn 
polynomials have not been previously established. However, the asymptotics of the previous sections may 
now be applied to the special case of the Hahn polynomials, and this yields new results for the asymptotic 
properties of the hexagon tiling problem (see Theorems ^ and ^ below). 

We first state the relation between hexagon tiling and discrete orthogonal polynomial ensembles. We 
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Figure 5: A rhombus tiling of the abc-hexagon, and the lattice C m when m = 3; holes are represented by 
white dots and particles are represented by black dots. 

will assume without loss of generality that a > b (by the symmetry of the hexagon, the case when a < b is 
completely analogous). Consider the m th vertical line of the lattice L counted from the left. We denote by 
C m the intersection of this line and C. In a given tiling, the points in C m correspond to positions (in the 
sense defined above) of a number of rhombi of types I, II, and III. We call the positions of horizontal rhombi 
the particles, and the positions of vertical rhombi the holes. See Figure]^ for an example of C m when m = 3, 
illustrating the corresponding particles and holes. 

The uniform probability distribution on the ensemble of tilings induces the probability distribution for 
finding particles and holes at particular locations in the one-dimensional finite lattice C m . A surprising 
result is due to Johansson |l(| which states that the induced probability distribution functions for holes 
and particles are both discrete orthogonal polynomial ensembles with Halm and associated Hahn weight 
functions respectively (see (^]) and ^). 

Let Q m be the lowest point in the sublattice C m . On the sublattice £ m , there are always c particles, 
and L m holes. We set 7,„ = c + L rn — 1. Now, let x\ < ■ ■ ■ < x c , where Xj S {0, 1,2,..., 7, n }, denote the 
(ordered) distances of the particles in C m from Q m , and let £i < ■ ■ ■ < £,L m , where £j G {0, 1, 2, ... , 7™}, 
denote the distances of the holes in C m from Q m . In particular, we then have {x\, . . . , x c } U {£1, . . . , £l„,} = 
{0, 1, 2, • • ■ , 7 m }. Let P m {x\, . . . , x c ) denote the probability of finding the particle configuration x%, ■ ■ ■ , x c , 
and let P TO (£i, . . . ,£z, m ) denote the probability of finding the hole configuration £1, • • • ,£i, m . 



Proposition 2 (Theorem 4.1 of Jig])- Let a,b, c>l be given integers with a >b. Set a m := \a — m\ and 

b m '■= \b — m\. Then 

1 c 
P m (xi,...,x c ) = -~- Y[ (xj ~ x k ) 2 Y[w(xj) , (57) 

Zm l<j<k<c j=l 



17 



where Z m is the normalization constant (partition function), and where the weight function is the associated 
Hahn weight 

»(») := <T(a m + 1, b m + 1) = n]{am + n y, {N _ n C mN _ n _ l + bm)l , ^ 
for a certain constant C . Also, 

i 

p m (a,...,a„j = — n &-&) 2 II w &)< ( 59 ) 

m l<j<fc<L m j=l 

where Z m is the normalization constant, and where the weight function is the Hahn weight 

i \ Halm/ , t h L1 \ {n + a m )\{N- n- 1 + 6 m )! 

: = ^w,n ( a m + !. h m + 1) = n!(iV-n- 1)! ' ^ ' 

for a certain constant C . 
With 

m = rn , (61) 



for 7 > 0, the scaling (|56[) is precisely the same scaling that we analyzed in § 3A , Also we can explicitly 
compute the equilibrium measure for Hahn and associated Hahn using either the result of [^9|, or solving 
the variational problem as in 0, which will appear in the full version of this paper. The calculations of 
the equilibrium measure and the one-point correlation function imply that as n —> oo, the one-dimensional 
lattice £ m , after rescaling to finite size independent of n, consists of three disjoint intervals: one band, 
surrounded by two gaps (either saturated regions or voids, depending on parameters). The saturated regions 
and voids correspond to the frozen regions or polar zones, while the central band is a temperate region. 
Hence in particular, the endpoints the band when considered as functions of r determine the typical shape 
of the boundary between the polar and temperate zones of the rescaled a&c-hcxagon. Also the one-point 
function converges pointwise except at the band edges, or at the boundary, to the equilibrium measure. 
This was conjectured in []l6| (including the edges), in which weak convergence was obtained. Moreover, our 
computation of the one-point correlation function provides the relevant error bounds, when we consider sets 
U contained in a single line C m . One expects that with additional analysis of the same formulae it should be 
possible to show that the error is locally uniform with respect to r, in which case the same bounds should 
hold for more general regions U e R 2 . We state our result in this direction as follows. 

Theorem 8 (Strong asymptotics with explicit error bounds). On the line C m , where m = rn and 
t is fixed as n — > oo, the scaled holes £j/n lying in the polar zones, uniformly bounded away from the 
rescaled expected boundary between the polar and temperate zones, have a one-point function asymptotically 
convergent to either 1 (in the polar zones near the vertices Pi and P§), or to (in the polar zones near 
the vertices P\, P3, P4, and Pq), with an exponential rate of convergence of the order 0(e~ Kn ) for some 
constant K > 0. The one-point correlation function for the scaled holes l;/n in the temperate zone converges 
to the corresponding equilibrium measure with an error of the order 0(l/n), which is uniformly valid away 
from the rescaled expected boundary between the polar and temperate zones. 

< Remark: We give the above error estimate as 0(l/n) rather than 0(log(n)/n) because the Hahn and 
associated Hahn polynomials are orthogonal on a set of nodes X n that are equally spaced. > 

In the temperate zone, in addition to the one-point function, which is the marginal distribution, we can 
control all fc-point correlation functions under proper scaling. One such consequence is the following theorem 
on the scaling limit for the locations of the holes. 
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Theorem 9 (Discrete sine kernel correlations). Let x > be rational such that nx G Zjv and such that 
nx is in the temperate zone away from the expected boundary between the polar and temperate zones with 
uniform order in n . Let B m = {nx, nx + ji , nx + , • ■ ■ ^nx + ju}, and set B = {0,ji,j2, • • • ,Jm}- Then 



If d p 

lim Y(there are precisely p holes in the set B rn ) = — — - — 

n->oo p\ \ dtP 



det (1 - ftS| B ) , (62) 



where S acts on £ 2 (Z) with the kernel 

fid * = ™ 

for some constant c(x) . 



sm{c(x)(i - j)) 

b{i,]) = T- ^ ' ( 63 ) 



< Remark: All of the results we have written down for holes have analogous statements in terms of 
particles using the duality relation between the Hahn and associated Hahn weights that will be explained in 

> 



4.2 



< Remark: Once one obtains the asymptotics near the band edge of the equilibrium measure for discrete 
orthogonal polynomial ensembles, fluctuation statistics of the boundary curve will be computable. It is 
conjectured in |]l6f that the limiting law at the band edge is the Tracy- Widom distribution known from the 
Gaussian unitary ensemble of random matrix theory. > 



4 Riemann-Hilbert Problems for Discrete Orthogonal Polynomi- 
als 

In this section, we discuss the main ideas of asymptotic analysis of discrete orthogonal polynomials via a 
Riemann-Hilbert problem. 

4.1 The fundamental Riemann-Hilbert problem. 

We first introduce the Riemann-Hilbert problem characterization of discrete orthogonal polynomials. For 
k G Z consider the matrix P(z;7V, k) solving the following problem, which is a discrete version of the 
analogous problem for continuous weights first used in fl2| . 

Riemann-Hilbert Problem 1. Find a 2x2 matrix P(z;N,k) with the following properties: 

1. Analyticity: P(z; N, k) is an analytic function of z for z G C \ Xm- 

2. Normalization: As z — > oo, 



3. Singularities: At each node xnj, the first column of P is analytic and the second column ofP has a 
simple pole, where the residue satisfies the condition 

Res P( Z ;N,k) = lim P( Z ;N,k) ( ° ) = (° W " J *f NJ ''%% ) (65) 

z=x Ntj z^x N ,j \ (J J \ WN,j-T2l{XN,j, Jy, k) J 

forj = 0,...,N-l. 
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Proposition 3. Riemann-Hilbert Problem^ has a unique solution when < k < N — 1. In this case, 



P(z;N,k) 



if k > and 



( 



KN,k{z) 



„(k-l) 



P(z;iV,0) 



V 



JV-l 

E 

J'=0 



Z — XN,j 



i x N,j) \ 



v,k-i(z) 

3=0 



N-l (fe-1) / 1 



2 ^ XN.j 



N-l 



i E 



W N ,j 

z — xn.. 



1 / 



(66) 



(67) 



We analyze this Riemann-Hilbert problem asymptotically as N, k — ► oo by adapting the Dcift-Zhou 
procedure developed in [l0[ | and subsequent work. Due to the conditions on the pole and the separation 
of the zeros of discrete orthogonal polynomials, we have two difficulties which we mentioned in § 1.2. In 



the following two sections, we describe the main techniques we developed to overcome these difficulties. A 
complete asymptotic analysis of the above Riemann-Hilbert problem will appear in the full version of this 
paper. 

In order to apply the usual Dcift-Zhou method, we will transform the above Riemann-Hilbert problem 
into a Riemann-Hilbert problem with jump conditions on continuous contours: a transformation from P i— > 
R. For this new Riemann-Hilbert problem, the formal limit of accumulation of nodes can be rigorously 
justified. However, in addition to the continuum limit of iV — > oo [N being the number of nodes), we 
simultaneously take the large degree limit k — ► oo. In the analysis of [§l 0, a method for this limit is to 
conjugate the Riemann-Hilbert problem with the so-called g-function that is defined as a log transform of the 
equilibrium measure. It was crucial in the analysis of J^, |j| for the continuous orthogonal polynomials that 
the equilibrium measure has only a lower constraint. Actually the lower constraint yields an exponentially 
decaying factor. Hence the upper constraint condition ( |l7j ) of the equilibrium measure for discrete orthogonal 
polynomials generates an exponentially growing factor. In order to replace an exponentially growing term 
with an exponentially decaying term, we introduce another transformation before we map P to R: we will 
introduce an intermediate Riemann-Hilbert problem for Q so that P i— ► Q i— > R as an exact sequence of 
transformations. 



In S 4.2, we discuss the transformation P 



Q of reversing the triangularity of residue 
matrices, that will eventually work in our favor turning exponentially growing terms into exponentially 
decaying terms. In § 4.3, we discuss the transformation Q i— > R from a Riemann-Hilbert problem with 



residue conditions to a Riemann-Hilbert problem with jumps on continuous contours. 



4.2 Selectively reversing triangularity of residue matrices. 

Riemann-Hilbert Problem ^ involves residue matrices that are upper-triangular. It will be advantageous 
in general to modify the matrix P(z;iV, k) in order to arrive at a new Riemann-Hilbert problem in which 
we have selectively reversed the triangularity of the residue matrices near certain individual nodes xnj- 
Let A C Zjy where Zjv := {0, 1, 2, . . . , N — 1} and denote the number of elements in A by #A and the 
complementary set Z^r \ A by V. We will reverse the triangularity for those nodes x^.j for which j e A. 
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Consider the matrix Q(z; N, k) related to the solution P(z; N, k) of Riemann Hilbert Problem [j] as follows: 

/ Uiz-XN.n)- 1 \ 



Q(z;N,k) :=P(z;N,k) 



\{z~ XN,n) 
.neA 



P(z;N,k) 



neA 



\ 



\{Z~ X Nj „) 



neA 



It is direct to check that the matrix Q(z; N, k) is, for k G Zjv, the unique solution of the following Ricmann- 
Hilbcrt problem. 

Riemann-Hilbert Problem 2. Given a subset A o/Zjy of cardinality #A, find a 2 x 2 matrix Q(z; iV, fc) 
wif/i £/ie following properties: 

1. Analyticity: Q(z; iV, fc) zs an analytic function of z for z£C \ -Xjv- 

2. Normalization; As z — > oo, 



Q(z;iV,fc) 



y #A-k Q 

z fc -# A 



= 1 + 



(69) 



3. Singularities: At each node xnj, the matrix Q has a simple pole. If j € V where V := Zjy \ A, i/ien 
t/ie first column is analytic at xjy.j and the pole is in the second column such that the residue satisfies 
the condition 



Res Q(z;N,k)= lim Q(z;N,k) 

Z—XN.j Z—*XN,j 



I w NJ Y[ (xnj ~ x Ntn ) 2 ^ 

neA 



V o 







(70) 



/ 



for j £ V. If j € A, then the second column is analytic at xnj and the pole is in the first column such 
that the residue satisfies the condition 



( 



Res Q(z;N,k)= lim Q(z;7V,fc) 

z—xnj z — >xnj 







w N .. 



\ (xnj ~ x N . n ) 2 



(71) 



/ 



for j S A. 



Note that the (21)-entry of the residue matrix in ( |7l| ) is the reciprocal of the (12)-entry of the residue 
matrix in (|7C|). When we make the choice that A contains the saturated regions, the effect will be to turn 
exponentially growing factors into exponentially decaying factors. Let us be more specific about how we 
choose A. In each band 1^ lying between a void and a saturated region we choose a point yk, and "quantize" 
these to the lattice Xn by associating with each point a sequence {y/c,Jv}?/ = o converging to yf. as N — > oo 
with elements given by 



rVk,N 

N l -° 



p (x) dx 



N / p v (x)dx 



(72) 



where \u~\ denotes the least integer greater than or equal to u. Thus yk,N li es asymptotically halfway between 
two consecutive nodes. For each N, these points are the common endpoints of two complementary systems 
of subintcrvals of (a, b). We denote the union of open subintcrvals delineated by these points and containing 
no saturated regions by S^ 7 . The complementary system of subintervals contains no voids and is denoted by 
E£. See Figure § 
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Figure 6: A schematic diagram showing the relation of the minimizer /i^ in (a;) to the interval systems Sq 7 
and S^. The nodes XN,j G (a,b) are indicated on the x-axis with triangles; their density is proportional 
to the upper constraint. The common endpoints of subintervals of Sq 7 and converge as N — > oo to the 
points yk indicated on the x-axis. 

Based on this partitioning of (a, 6), the specific choice we make is that A is the set of indices j £ Zjy 
such that XN,j G E^. 

< Remark: After the completion of this work, we learned that an analogous transformation was used in 
H for a somewhat different asymptotic analysis of a special choice of discrete orthogonal polynomials (Askey- 
Lesky weights, or general Hahn weights). In H), the choice of the region A to reverse the triangularity was 
made using intuition from representation theory. (We thank A. Borodin for bringing this to our attention.) 
For the limit of interest in this paper, we determine the region A in order to reverse the triangularity in 
saturated regions of the equilibrium measure while preserving triangularity in all voids. > 

4.2.1 Dual families of discrete orthogonal polynomials. 

The relation between Riemann-Hilbert Problem |l| and Riemann-Hilbert Problem || gives rise in a special 
case to a remarkable duality between pairs of weights {wnj} defined on the same set of nodes and their 
corresponding families of discrete orthogonal polynomials that comes up in applications. Given nodes 
and weights {wjf j}, take A = Zjv and let 

P(z;N,k) := aiCl(z;N,k)a 1 , where k := N — k . (73) 

Thus, we are reversing the triangularity at all of the nodes, and swapping rows and columns of the resulting 
matrix. It is easy to check that P(z;N, k) satisfies 

P(z;N,k)l Z Q k j Ul + O^j as z oo (74) 
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and is a matrix with simple poles in the second column at all nodes, such that 

Wjvj 



Res P(z;N,k) = lim P(z;N,k)[ JVJ (75) 



holds for j G Z^v, where the "dual weights" {wnj} are defined by the identity 

JV-l 

w N .jW N .j JJ (xjvj - a;jv,™) 2 = 1 ■ (76) 

n=0 

Comparing with Riemann-Hilbert Problem |l| we see that Pn(z; N, k) is the monic orthogonal polynomial 
tt n -^(z) of degree k associated with the dual weights {wpj.j}- In this sense, families of discrete orthogonal 
polynomials always come in dual pairs. An explicit relation between the dual polynomials comes from the 
representation of P(z; N, k) given by Proposition ^|: 

n N ,k(z) = Pn(z;N,k) 

N-l 

= P 2 2(z;N,k)Y[{z-x N , n ) 

n=0 (77) 

N-l 2 N-l 

= ^W N j c'fi'f, 1 ^ TTN,k-l{xN,j) JJ {z ~ XN,n) ■ 
j=0 n=0 

Since the left-hand side is a monic polynomial of degree k = N — k and the right-hand side is apparently a 
polynomial of degree N — 1, equation ([77]) furnishes k relations among the weights and the normalization 
constants c^\. 

In particular, if we evaluate ( |77j ) for z = xjy.z for some I G Zjv, then only one term from the sum on the 
right-hand side survives and we find 

2 N ~ 1 

W N,k( X N,l) = c N^k-l w N,lJ\(XN,l-X N , n )-7T N ,k-l(xN,l), (78) 

n=0 

an identity relating values of each discrete orthogonal polynomial and a corresponding dual polynomial at 
any given node. The identity ( ]7q ) has also been derived by Borodin g|. 

< Remark: We want to point out that the notion of duality described here is different from that explained 
in |23|| . The latter generally involves relationships between families of discrete orthogonal polynomials with 
two different sets of nodes of orthogonalization. For example, the Hahn polynomials are orthogonal on a 
lattice of equally spaced points, and the polynomials dual to the Hahn polynomials by the scheme of p3| ] are 
orthogonal on a quadratic lattice for which xn,u — xn,u-i is proportional to n. However, the polynomials 
dual to the Hahn polynomials under the scheme described above are the associated Hahn polynomials, which 
are orthogonal on the same equally-spaced nodes as are the Hahn polynomials themselves. The notion of 
duality we use in this paper coincides with that described in Q and is also equivalent to the "hole/particle 
transformation" considered by Johansson |r| . > 

4.3 Removal of poles in favor of jumps on contours. 

The deformations in this section are based on similar ones first introduced by one of the authors in p2[ . Let 
the analytic functions P±(z) be given by 

P±{z) := ±iexp (^finN p°(s)ds^J . (79) 
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Note that by definition, (3 + (xM,j) = /3-{%N,j) = ( — 1) W 1 J for all iV £ N and j € Z^r. Consider the contour 
E illustrated in Figure 0. ^From the solution of Ricmann-Hilbcrt Problem || we define a new matrix R(z) 



E* 


El 


E- 


EY 


E? 




/ "+\ 


/ n + \ 


/ Vl\ \ 




yA 




VA \/ 


vv \ 


E A 




A u A 


A 


A 


■^0 




\" V / 


\ " A / 






E^ 


eY 


E* 


eY 


E* 



6 



Figure 7: TVie contour E consists of the subintervals E^ and E^ as in Figure [(] and associated contour 
segments EY and E^ in i/ie upper half-plane, and EY and E^ in ifte lower half-plane. The enclosed regions 
f2j and f2± are a/so indicated. The contour E /ies entirely in the region of analyticity ofV(x) and p°(x). 
All components of E are taken to be oriented from left to right. 



as follows. Set 



/ 



R(z) := Q(z;N,k) 



~[(z-x N>j ) \ 



1 -P ± (z)e- NV »W^ 



Y[(z-x N>j ) 

iev 



for z£fl v 



± • 



(80) 



R(z) := Q(z;iV,fc) 



\ 



V 



Y[(z-x Ntj ) 

-f3 ± ( z )e NV »W^ 1 

[[(z-x N j) 



for ze!lj, 



(81) 



and for all other z set R(z) := Q(z; N, k). 

The matrix R(z) is, for arbitrary N € N and A: S Zjv, the unique solution of the following Ricmann- 
Hilbcrt problem. 

Riemann-Hilbert Problem 3. Find a 2 x 2 matrix R(z) wi£/i ine following properties: 

1. Analyticity: R(z) is an analytic function of z for z G C\ E. 

2. Normalization: z — > oo, 



R(z) 











[ - 



(82) 



3. Jump Conditions: R(z) takes continuous boundary values on E from each connected component of 
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\ S. Denoting the boundary values taken on the left (right) by R + (z) (R_(z)J, we have 

( ~[(z-x NJ ) \ 
1 ±f3 ± ( z )e- NV ^^ 



R+(z) = R_(z) 



Y[{z-x N ,j) 



V o 

for « 6 , 
/ 



(83) 



R+(z) = R_(z) 



\ 



±/ 3 ± (z) e ^«M^ 1 

V J'GA / 



(84) 



for z <E £± , 

1 (i3_(,)-^( Z ))e^Wf- 

iev 

V o 1 



R+(z) = R_(z) 



(85) 



/or z <E S^ 7 , and, 



/ 



R+(z) = R_(z) 



\ 



Y[(z-x NJ ) 

(P-(z)-P + (z))e NV ^^ 1 

V J6A / 



(86) 



for z £ . 

Note that all off- diagonal entries of the jump matrices are analytic nonvanishing functions on their 
respective contours. 

The significance of passing from Riemann-Hilbcrt Problem || to Riemann-Hilbert Problem || is that all 
poles have completely disappeared from the problem. All boundary values of R(z) and the corresponding 
jump matrices relating them are analytic functions. This means that Riemann-Hilbert Problem [}] is suffi- 
ciently similar to that introduced in for the continuous weight case that it may, in principle, be analyzed 
by methods like those used in [g, The main obstruction at this point is that the off-diagonal elements 
of the jump matrices for R(z) are not exactly of the form e NW ( z ) for some W{z). This is a consequence 
of the fact that the sequence of transformations P i— > Q i— ► R is exact, and one may observe at this point 
that the desired form e NW ^ can be achieved by carefully taking a natural continuum limit based on the 
assumptions on the nodes and weights set out at the beginning of this announcement. In other words, while 
the jump matrix for R does not have the desired form, one may introduce an approximate Riemann-Hilbert 
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problem for a matrix R(z) for which the jump matrix indeed has the desired form; part of the analysis then 
becomes the task of showing that R(z) and R(z) are "close". 

< Remark: The difficulty that prevents us from obtaining asymptotic results near the band edges can be 
traced back precisely to the fact that the off-diagonal elements of the jump matrices for R(z) are not exactly 
of the form g NW ^ z >, but only approximately so. In the local analysis, the (small) discrepancy between the 
approximate and the exact form prevents us from obtaining the necessary asymptotics up to the boundary 
of the contours. We hope to be able to overcome this problem in a future work. > 

The final aspect of our analysis that we would like to briefly describe is our choice of a g-function 
with which we stabilize the (approximate) Riemann-Hilbert problem for R(z). We introduce a new matrix 
unknown by the transformation S(z) := R(z)e^ A_fc ^ 9 ^) ' 3 where the ^-function is given by 

g(z) = / log(z - x)p(x) dx (87) 



with density determined differently in the two types of intervals Sq 7 and S A : 



d dx 



p{x) : 

c — d 

Here of course is the equilibrium measure, c — k/N, and 



d:= f p°(x)dx = ^. (89) 

With this choice of g(z), in conjunction with the choice of the set A described above, the jump matrices for 
S(z) are precisely of the type for which the steepest-descent factorization technique can be applied. Also, 
S(z) is now normalized to the identity matrix for large z; the power asymptotics have been removed. The 
complete details of the subsequent analysis, including rigorous error estimates, will appear in the full version 
of the paper corresponding to this announcement. 
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